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CRYSTALS OF AN ACYL CARRIER PROTEIN SYNTHASE/ACYL CARRIER 

PROTEIN COMPLEX 



This application is a divisional of U.S. Patent Application No. 09/770,834, filed 
January 25, 2001, now U.S. Patent No. 6,684,162, which claims the benefit of U.S. 
Provisional Application No. 60/202,466 filed May 8, 2000, the contents of both of which 
are herewith incorporated by reference. 

FIELD OF THE INVENTION 

The present invention relates to the crystal structure of the ACPS/ACP complex, 
as well as the three-dimensional solution structure of B, subtilis ACP These structures are 
critical for the design and selection of potent and selective agents which interact with 
ACPS and ACP, and particularly, the design of novel antibiotics. 

BACKGROUND OF THE INVENTION 
Acyl Carrier Proteins (ACPs) play important roles in a number of biosynthetic 
pathways that are dependent upon acyl group transfers [1]. They are most often 
associated with the biosynthesis of fatty acids [2,3], but they are also utilized in the 
synthesis of polyketide antibiotics [4,5], non-ribosomal peptides [6,7], and of 
intermediates used in the synthesis of vitamins such as the protein-bound coenzymes, 
lipoic acid [8] and biotin [9]. The ACP in each of these pathways is composed of 80-100 
residues and is either an integrated domain in a larger multi-functional protein (Type I 
synthase complex) or is a structurally independent protein that is part of a non-aggregated 
multi-enzyme system (Type II synthase complex). Type I synthases are found in 
mammals, fungi and certain Mycobacteria while type II ACPs are utilized by plants and 
most bacteria. The Escherichia coli ACP for fatty acid synthesis has been over-expressed 
[10] and purified [11,12], and the solution structure has been solved by NMR 
spectroscopy [13]. The fact that these proteins are essential for the maturation of the 
organism has led to their investigation as targets for the development of new anti- 
microbial agents [14-18]. 

ACPs require post-translational modification for activity. They are converted from 
an inactive apo-form to an active holo-form by the transfer of the 4 f -phosphopantetheinyl 
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(P-pant) moiety of coenzyme A to a conserved serine on the ACP. The B-hydroxy side 
chain of the serine residue serves as a nucleophilic group attacking the pyrophosphate 
linkage of Co A. Evidence now suggests [19] that each synthase that is dependent upon P- 
pant attachment for activation has its own partner enzyme responsible for this attachment. 
5 The post-translational modification of the ACP subunit in the fatty acid synthase 

is performed by holo-[acyl carrier protein] synthase (hereinafter defined as "ACPS"; 
Enzyme Commission No. 2.7.8.7). The best characterized member of the ACPS family is 
the E. coli ACPS [20]. The enzyme produces holo-ACP by transferring the P-pant moiety 
to Ser-36 of the E. coli apo-ACP (SEQ ID NO: 15) in a magnesium dependent reaction 
10 [20] as follows: 

CoA+ apo-ACP — -> holo-ACP + 3\5'-ADP 

ACPS, Mg 2+ 

1 5 The over-expression and purification of the E. coli ACPS has been described [2 1 ] 

and this protein is classified as a member of a new enzyme superfamily, the 
phosphopantetheinyl transferases [19]. Based on the size of the proteins, the P-pant 
transferase superfamily can be roughly divided into two subgroups [22]. Enzymes 
responsible for modifying the peptidyl carrier protein subunits of non-ribosomal peptide 

20 synthetases are good examples for the first subgroup, which are usually -230 amino acids 
in size. The structure of one of this subgroup enzymes, the surfactin synthetase activating 
enzyme Sfp, has been solved recently and it consists of a 2-fold intramolecular 
pseudosymmetry with the CoA binding site at the interface of the symmetrical fold [22]. 
ACPS and other enzymes transfering the P-pant group onto domains of the fatty acid 

25 synthases are usually smaller, about -120 residues, and belong to the second subgroup of 
the P-pant transferase superfamily. The sequence homology between these two subgroups 
is rather low, about 12-22% between E. coli ACPS and B. subtilis Sfp, for example, 
although both have been shown to possess P-pant transferase activity. Alignment [19] of 
some of these proteins show that two regions, residues 5-13 and 54-65 (E. coli ACPS 

30 numbering), are highly conserved with five of the residues in these regions identical. 
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While numerous members of the phosphopantetheinyl transferase superfamily 
have been identified and sequenced, until the present invention, the crystal structure of 
ACPS complexed with halo-ACP, and the three dimensional structure of the ACPS/ACP 
active site has not been determined. Further, prior to the present invention, the solution 
5 structure of B. subtilis ACP had not been determined. 

SUMMARY OF THE INVENTION 
The present invention relates to a crystallized complex comprising an acyl carrier 
protein synthase (ACPS) and an acyl carrier protein (ACP) (hereinafter referred to as 

10 "ACPS-ACP complex"). The invention is further directed to the three dimensional 

structure of the ACPS-ACP complex, as determined using crystallographic analysis (with 
or without sedimentation analysis) of the ACPS-ACP complex. Particularly, the invention 
is directed to the three dimensional structure of the ACP binding site present in ACPS and 
other ACPS-like P-pant transferases, alone, and as complexed with ACP or other agents 

15 that interact with the ACP binding site of said transferases. In addition, the invention is 
directed to the ACPS binding site on ACP. Identification of the three dimensional 
structure of the ACP binding site on ACPS and the ACPS binding site on ACP will be 
valuable for the design of antibiotics and other agents that interfere with P-pant 
attachment, thereby preventing activation of corresponding carrier proteins. 

20 The invention additionally provides a method for identifying an agent that 

interacts with any active site of an ACPS-ACP complex, comprising the steps of 
determining a putative active site of an ACPS-ACP complex from a three dimensional 
model of the ACPS-ACP complex, and performing various computer fitting analyses to 
identify an agent which interacts with the putative active site. Again, such agents may act 

25 as inhibitors or activators of ACPS-ACP complex activity, as determined by obtaining the 
identified agent, contacting the same with ACPS-ACP complex, and measuring the 
agent's effect on ACPS-ACP complex activity. 

In addition, the invention provides a solution comprising B. subtilis ACP having a 
three dimensional structure defined by the structural coordinates of Figures 5 and 5A-1 to 

30 5A-1 5, ± a root mean square deviation from the backbone atoms of said amino acids of 
not more than 1.5 A. Also provided by the invention is any active site of B. subtilis ACP 



that is defined by the structural coordinates of Figures 5 and 5A-1 to 5A-15, ± a root 
mean square deviation from the backbone atoms of said amino acids of not more than 
1 .5A. Further, the present invention provides a method for identifying an agent that 
interacts with any active site of B. subtilis ACP, comprising the steps of determining a 
5 putative active site of ACP from a three dimensional model of the ACP, and performing 
various computer fitting analyses to identify an agent which interacts with the putative 
active site. Again, such agents may act as inhibitors or activators of ACP activity, as 
determined by obtaining the identified agent, contacting the same with ACP, and 
measuring the agent's effect on ACP activity. 

10 Yet another aspect of the present invention is a method for identifying an activator 

or inhibitor of any molecule or molecular complex which comprises an ACP binding site, 
including any member of the ACPS-like P-pant transferases, comprising the steps of 
generating a three dimensional model of said molecule or molecular complex using the 
relative structural coordinates according to Figures 3 and 3A-1 to 3A-79 of residues 

15 Argl4, Metl8, Arg21, Gln22, Arg24, Phe25, Arg28, Phe54, Glu58, Ile68, Gly69, Ala70, 
Ser73 and Phe74 from a first monomer of ACPS (SEQ ID NO:2), and residue Arg45 from 
a second monomer of ACPS (SEQ ID NO:2), or additionally, of residues Asp8, Ile9, 
ThrlO, Glull, Leul2, Ilel5, Alal6, Serl7, Alal9, Gly20, Ala23, Ala26, Glu27, Ile29, 
Ala51, Lys57, Ser61, Lys62, Thr66, Gly67, Gln71, Leu72, Gln75, Asp76, Ile77 and 

20 Lys93 from the first monomer of ACPS (SEQ ID NO:2) and residues Leu41, Ser42, 

Lys44, Glu48, Gln83, Asn84, HislOS, Thrl06 and Alal07 from the second monomer of 
ACPS (SEQ ID NO:2), in each case ± a root mean square deviation from the backbone 
atoms of said amino acids of not more than 1 .5A, and then selecting or designing a 
candidate activator or inhibitor that interacts with said molecule or molecular complex 

25 using computer fitting analyses of interactions between the three dimensional model of 
the molecule or molecular complex and the candidate activator or inhibitor. The effect of 
the candidate activator or inhibitor may be evaluated by obtaining the candidate activator 
or inhibitor, contacting the same with the molecule or molecular complex, and measuring 
the effect of the candidate activator or inhibitor on molecular or molecular complex 

30 activity. 
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In addition, the present invention provides a method for identifying an activator or 
inhibitor of any molecule or molecular complex which comprises an ACPS binding site, 
comprising the steps of generating a three dimensional model of said molecule or 
molecular complex comprising an ACPS binding site using the relative structural 

5 coordinates according to Figures 3 and 3A-1 to 3A-79 or Figures 5 and 5A-1 to 5A-15 of 
residues Argl4, Lys29, Asp35, Ser36, Leu37, Asp38, Val40, Glu41, Val43, Met44, Glu47, 
Asp48, Ile54, Ser55, Asp56, Glu57 and Glu60, or additionally, of residues Aspl3, LeulS, 
Phe28, Glu30, Asp31, Leu32, Gly33, Ala34, Val39, Leu42, Glu45, Leu46, Glu49, Met52, 
Glu53, Asp58, Ala59, and Lys61, according to the sequence set forth in SEQ ID NO:l, in 

10 each case ± a root mean square deviation from the backbone atoms of said amino acids of 
not more than 1 .5A, and then selecting or designing a candidate activator or inhibitor that 
interacts with said molecule or molecular complex using computer fitting analyses of 
interactions between the three dimensional model of the molecule or molecular complex 
and the candidate activator or inhibitor. The effect of the candidate activator or inhibitor 

15 may be evaluated by obtaining the candidate activator or inhibitor, contacting the same 
with the molecule or molecular complex, and measuring the effect of the candidate 
activator or inhibitor on molecular or molecular complex activity. Also provided by the 
present invention are the activators or inhibitors selected or designed using the above- 
noted methods. 

20 Still further, the present invention is directed to a method of determining the three 

dimensional structure of a molecule or molecular complex whose structure is unknown, 
comprising the steps of first obtaining crystals of the molecule or molecular complex 
whose structure is unknown, and then generating X-ray diffraction data from the 
crystallized molecule or molecular complex. The X-ray diffraction data from the 

25 molecule or molecular complex is compared with the known three dimensional structures 
determined from the ACPS-ACP crystals of the present invention, and molecular 
replacement analysis is used to conform the known three dimensional structures to the X- 
ray diffraction data from the crystallized molecule or molecular complex. 

In addition, the present invention provides the ACP active site of an ACPS-like P- 

30 pant transferase, including, but not limited to, an ACPS, comprising the structural 

coordinates according to Figures 3 and 3A-1 to 3A-79 of residues Arg 14, Metl8, Arg21, 



Gln22, Arg24, Phe25, Arg28, Phe54, Glu58, Ile68, Gly69, Ala70, Ser73 and Phe74 from 
a first monomer of ACPS (SEQ ID NO:2), and residue Arg45 from a second monomer of 
ACPS (SEQ ID NO:2), in each case ± a root mean square deviation from the backbone 
atoms of said amino acids of not more than L5 A. In another embodiment, the active site 
may include, in addition to the structural coordinates above, the relative the structural 
coordinates according to Figures 3 and 3A-1 to 3A-79 of residues Asp8, Ile9, ThrlO, 
Glull, Leul2, Ilel5, Alal6, Serl7, Alal9, Gly20, Ala23, Ala26, Glu27, Ile29, Ala51, 
Lys57, Ser61, Lys62, Thr66, Gly67, Gln71, Leu72, Gln75, Asp76, Ile77 and Lys93 from 
one monomer of ACPS (SEQ ID NO:2) and residues Leu41, Ser42, Lys44, Glu48, Gln83, 
Asn84, Hisl05, Thrl06 and Alal07 from a second monomer of ACPS (SEQ ID NO:2), in 
each case ± a root mean square deviation from the backbone atoms of said amino acids of 
not more than 1.5 A. 

Finally, the present invention provides the ACPS active site of ACP (SEQ ID 
NO:l), comprising the structural coordinates according to Figures 3 and 3A-1 to 3A-79 
or Figures 5 and 5A-1 to 5A-15 of residues Argl4, Lys29, Asp35, Ser36, Leu37, Asp38, 
Val40, Glu41, Val43, Met44, Glu47, Asp48, Ile54, Ser55, Asp56, Glu57 and Glu60, in 
each case ± a root mean square deviation from the backbone atoms of said amino acids of 
not more than 1.5 A. In another embodiment, the active site may include, in addition to 
the structural coordinates above, the relative structural coordinates according to Figures 3 
and 3A-1 to 3A-79 or Figures 5 and 5A-1 to 5A-15 of residues Aspl3, Leul5, Phe28, 
Glu30, Asp31, Leu32, Gly33, Ala34, Val39, Leu42, Glu45, Leu46, Glu49, Met52, Glu53, 
Asp58, Ala59, and Lys61 according to the sequence set forth in SEQ ID NO:l, ± a root 
mean square deviation from the backbone atoms of said amino acids of not more than 1 .5 
A. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 depicts the amino acid sequences for the forms of ACP (SEQ ID NO:l) 
and ACPS (SEQ ID NO:2) used in the growth of ACP/ACPS complex crystals. 

Figure 2 illustrates the alignment of amino acid sequences for twelve members of 
the ACPS family, including the consensus sequence. Depicted are amino acid sequences 
for Aquifex (SEQ ID NO:3), Chlamydophila (SEQ ID NO:4), Helicobacter (SEQ ID 



N0:5), Staphylococcus (SEQ ID NO:6) s Thermotoga (SEQ ID NO:7), Escherichia (SEQ 
ID NO:8), Rickettsia (SEQ ID NO:9) 3 Streptomyces (SEQ ID NO: 10), Treponema (SEQ 
ID NO: 11), Bacillus (SEQ ID NO: 12), Bradyrhizobium (SEQ ID NO: 13), and 
Mycobacterium (SEQ ID NO: 14). 

5 Figures 3 and 3 A-l to 3 A-79 provide the atomic structural coordinates for ACPS 

and ACP as derived by X-ray diffraction of an ACPS-ACP crystal. "Atom type" refers to 
the atom whose coordinates are being measured. "Residue" refers to the type of residue of 
which each measured atom is a part-i.e., amino acid, cofactor, ligand or solvent. The "x, 
y and z" coordinates indicate the Cartesian coordinates of each measured atom's location 

10 in the unit cell (A). "Occ" indicates the occupancy factor. "B" indicates the "B-value", 
which is a measure of how mobile the atom is in the atomic structure (A 2 ). "MOL" 
indicates the segment identification used to uniquely identify each molecule. Under 
"MOL", "Al", "Bl" and "CI" refers to each molecule of ACPS (SEQ ID NO:2) 5 "API", 
"AP2" and "AP3" refers to each molecule of ACP (SEQ ID NO:l), and "W" refers to 

15 water molecules. 

Figure 4 represents the sequence alignment of B. subtilis ACP (SEQ ID NO:l), E. 
coli ACP (SEQ ID NO: 15), and Streptomyces coelicolor A3(2) ACP (SEQ ID NO: 16). 

Figures 5 and 5 A-l to 5A-15 provide the atomic structural coordinates for the 
restrained minimized mean structure of B. subtilis ACP (SEQ ID NO:l) as derived by 

20 NMR spectroscopy. "Atom type" refers to the atom whose coordinates are being 

measured. "Residue" refers to the type of residue of which each measured atom is a part- 
i.e., amino acid, cofactor, ligand or solvent. The "x, y and z" coordinates indicate the 
Cartesian coordinates of each measured atom's location (A). The last column indicates 
the temperature factor field, representing the rms deviation of the 22 individual NMR 

25 structures about the restrained minimized mean structure. All non-protein atoms are listed 
as HETATM instead of atoms using PDB conventions. 

DETAILED DESCRIPTION OF THE INVENTION 
As used herein, the following terms and phrases shall have the meanings set forth 

30 below: 
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"ACPS" includes acyl carrier protein synthases as well as "ACPS-like" P-pant 
transferases. Acyl carrier protein synthases produce a holo-fatty acid synthase ACP by 
transferring the P-pant moiety to Ser-36 (or equivalent Serine) of an apo-fatty acid 
synthase ACP in a magnesium dependent reaction. "ACPS-like" P-pant transferases are 
those enzymes having P-pant transferase activity (i.e., that transfer the 4'- 
phosphopantetheinyl moiety of CoA to a conserved serine on the corresponding target 
molecule) which form homodimers and activate the ACP domains or subunits of fatty 
acid synthases, polyketide synthases or other enzymes. 

As used herein, "ACP" is the carrier of fatty acids during fatty acid biosynthesis, 
is responsible for acyl group activation and includes a 4'phosphopantetheine (4'-PP) 
prosthetic group in which the 4 f -PP moiety is attached through a phosphodiester linkage 
to a specific conserved serine residue. "ACP" also includes an active (holo) and inactive 
(apo) form where activation of ACP is mediated by Holo-acyl carier protein synthase 
(ACPS), and is preferably the active (holo) form. 

Unless otherwise indicated, "protein" shall include a protein, protein domain, 
polypeptide or peptide. 

"Structural coordinates" are the Cartesian coordinates corresponding to an atom's 
spatial relationship to other atoms in a molecule or molecular complex. Structural 
coordinates may be obtained using x-ray crystallography techniques or NMR techniques, 
or may be derived using molecular replacement analysis or homology modeling. Various 
software programs allow for the graphical representation of a set of structural coordinates 
to obtain a three dimensional representation of a molecule or molecular complex. The 
structural coordinates of the present invention may be modified from the original sets 
provided in Figures 3 and 3A-1 to 3A-79 or Figures 5 and 5A-1 to 5A-15 by 
mathematical manipulation, such as by inversion or integer additions or subtractions. As 
such, it is recognized that the structural coordinates of the present invention are relative, 
and are in no way specifically limited by the actual x, y, z coordinates of Figures 3 and 
3A-1 to 3A-79 and Figures 5 and 5A-1 to 5A-15. 

An "agent" shall include a protein, polypeptide, peptide, nucleic acid, including 
DNA or RNA, molecule, compound, antibiotic or drug. 
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"Root mean square deviation" is the square root of the arithmetic mean of the 
squares of the deviations from the mean, and is a way of expressing deviation or variation 
from the structural coordinates of ACPS and ACP described herein. The present invention 
includes all embodiments comprising conservative substitutions of the note amino acid 
5 residues resulting in same structural coordinates within the stated root mean square 
deviation. 

It will be obvious to the skilled practitioner that the numbering of the amino acid 
residues in the various isoforms of ACPS, other ACPS-like P-pant transferases and ACP 
may be different than that set forth herein or may contain certain conservative amino acid 

10 substitutions that yield the same three dimensional structures as those defined in Figures 
3 and 3A-1 to 3A-79 and Figures 5 and 5A-1 to 5A-15. Corresponding amino acids and 
conservative substitutions in other isoforms or analogues are easily identified by visual 
inspection of the relevant amino acid sequences or by using commercially available 
homology software programs (e.g., MODELLAR, MSI, San Diego, Calif). 

15 "Conservative substitutions" are those amino acid substitutions which are 

functionally equivalent to the substituted amino acid residue, either by way of having 
similar polarity, steric arrangement, or by belonging to the same class as the substituted 
residue (e.g., hydrophobic, acidic or basic) and includes substitutions having an 
inconsequential effect on the three dimensional structure of the ACPS-ACP complex, and 

20 the solution structure of B. subtilis ACP, with respect to the use of said structures for the 
identification and design of agents which interact with ACPS and ACP, for molecular 
replacement analyses and/or for homology modeling. 

As used herein, an "active site" refers to a region of a molecule or molecular 
complex that, as a result of its shape and charge potential, favorably interacts or 

25 associates with another agent (including, without limitation, a protein, polypeptide, 

peptide, nucleic acid, including DNA or RNA, molecule, compound, antibiotic or drug) 
via various covalent and/or non-covalent binding forces. 

As such, the active site of the ACPS-ACP complex may include both the actual 
site of ACP binding with ACPS, as well as accessory binding sites adjacent or proximal 

30 to the actual site of ACP binding that nonetheless may affect ACPS, ACPS-ACP or ACP 
activity upon interaction or association with a particular agent, either by direct 



interference with the actual site of ACP binding or by indirectly affecting the steric 
conformation or charge potential of the ACPS molecule and thereby preventing or 
reducing ACP binding to ACPS at the actual site of ACP binding. As used herein, an 
active site of ACPS-ACP also includes ACPS or ACPS analog residues which exhibit 
observable NMR perturbations in the presence of a binding ligand, such as ACP. While 
such residues exhibiting observable NMR perturbations may not necessarily be in direct 
contact with or immediately proximate to ligand binding residues, they may be critical to 
ACPS residues for rational drug design protocols. 

The active site of ACP includes a region of ACP that, as a result of its shape and 
charge potential, favorably interacts or associates with another agent (including, without 
limitation, a protein, polypeptide, peptide, nucleic acid, including DNA or RNA, 
molecule, compound, antibiotic or drug) via various covalent and/or non-covalent 
binding forces. Preferably, the active site on ACP is the site of interaction with ACPS. 

The present invention is directed to a crystallized ACPS-ACP complex that 
effectively diffracts X-rays for the determination of the structural coordinates of the 
ACPS-ACP complex. As used herein, the proteins used in the ACPS-ACP crystal 
complex of the present invention includes any ACPS or ACP protein (i.e., as used herein, 
any protein, polypeptide or peptide), isolated from any source (including, but not limited 
to, a protein isolated from Aquifex, Chlamydophila, Helicobacter, Staphylococcus, 
Thermotoga, Escherichia, Rickettsia, Streptomyces, Treponema, Bacillus, 
Bradyrhizobium, and Mycobacterium). In a preferred embodiment of the invention, 
ACPS and ACP are both cloned and isolated from B. subtilis, and overexpressed in a 
commercially available E. coli system. 

The ACPS protein in the ACPS/ACP complex includes ACPS as well as proteins 
having ACPS-like P-pant transferase activity, including the consensus sequence shown in 
Figure 2. More preferably, the ACPS protein or proteins having ACPS-like P-pant 
transferase activity, comprises the relative structural coordinates according to Figures 3 
and 3A-1 to 3A-79 for the residues Argl4, Metl8, Arg21, Gln22, Arg24, Phe25, Arg28, 
Arg45, Phe54, Glu58, Ile68, Gly69, Ala70, Ser73 and Phe74, or conservative 
substitutions thereof, and additionally, the residues Asp8, Ile9, ThrlO, Glull, Leul2, 
IlelS, Alal6, Serl7, Alal9, Gly20, Ala23, Ala26, Glu27, Ile29, Leu41, Ser42, Ala44, 

10 



Glu48, Ala51, Lys57, Ser61 , Lys62, Thr66, Gly67, Gln71, Leu72, Gln75, Asp76, Ile77, 
Gln83, Asn84, Lys93, Hisl05, Thrl06 and Alal07, or conservative substitutions thereof. 
The amino acid residues correspond to the sequence set forth in SEQ ID NO:2. More 
particularly, the ACPS protein or proteins having ACPS-like P-pant transferase activity 
5 include an ACP binding site defined using the relative structural coordinates according to 
Figures 3 and 3A-1 to 3A-79 of residues Argl 4, Metl8, Arg21, Gln22, Arg24, Phe25, 
Arg28, Phe54, Glu58 5 Ile68, Gly69, Ala70, Ser73 and Phe74 from a first monomer of 
ACPS (SEQ ID NO:2), and residue Arg45 from a second monomer of ACPS (SEQ ID 
NO:2), or additionally including the relative structural coordinates of residues Asp8, Ile9, 

10 ThrlO, Glull, Leul2, Ilel5, Alal6, Serl7, Alal9, Gly20, Ala23, Ala26, Glu27, Ile29, 
Ala51, Lys57, Ser61, Lys62, Thr66, Gly67, Gln71, Leu72, Gln75, Asp76, Ile77 and 
Lys93 from the first monomer of ACPS (SEQ ID NO:2) and residues Leu 41, Ser42, 
Ala44, Glu48, Gln83, Asn84, Hisl05, Thrl06 and Alal07 from the second monomer of 
ACPS (SEQ ID NO:2). In each case, the ± a root mean square deviation from the 

15 backbone atoms of the amino acids is not more than 1 .5 A, more preferably not more than 
1 .0 A, and most preferably, not more than 0.5 A. 

The ACP protein in the ACPS/ACP complex includes ACP and proteins having 
ACP activity, and preferably comprises the relative structural coordinates accordingly to 
Figures 3 and 3A-1 to 3A-79 or Figures 5 and 5A-1 to 5A-15 for the residues Argl4, 

20 Lys29, Asp35, Ser36, Leu37, Asp38, Val40, Glu41, Val43, Met44, Glu47, Asp48, Ile54, 
Ser55, Asp56, Glu57 and Glu60, or conservative substitutions thereof, and additionally, 
the residues Aspl3, Leul5, Phe28, Lys29, Glu30, Asp31, Leu32, Gly33, Ala34, Asp35, 
Ser36, Leu37, Asp38, Val39, Leu42, Glu45, Leu46, Glu49, Met52, Glu53, Asp58, Ala59 
and Lys61, or conservative substitutions thereof, in each case ± a root mean square 

25 deviation from the backbone atoms of said amino acids of not more than 1 .5 A, or more 
preferably not more than 1 .0 A, or most preferably, not more than 0.5 A. The amino acid 
residues correspond to the sequence set forth in SEQ ID NO:l. 

The crystals of the present invention may take a wide variety of forms, all of 
which are included in the present invention. However, in a preferred embodiment of the 

30 present invention, the ACPS-ACP crystallized complex is characterized as being in rod- 
shape form with space group C222i, and having unit cell parameters of a=78.46 A, 



b=l 22.03 A and c=l 36.77 A, and consists of three molecules of ACPS and three 
molecules of ACP in an asymmetric unit. 

Once a crystal or crystal complex of the present invention is grown, X-ray 
diffraction data can be collected by a variety of means in order to obtain the atomic 
coordinates of the crystallized molecule or molecular complex. With the aid of 
specifically designed computer software, such crystallographic data can be used to 
generate a three dimensional structure of the molecule or molecular complex. Various 
methods used to generate and refine the three dimensional structure of a crystallized 
molecule or molecular structure are well known to those skilled in the art, and include, 
without limitation, multiwavelength anomalous dispersion (MAD), multiple isomorphous 
replacement, reciprocal space solvent flattening, molecular replacement, and single 
isomorphous replacement with anomalous scattering (SIRAS). 

The present invention is also directed to an ACP active site of an ACPS-like P- 
pant transferase, including the active site of ACPS, and comprising the structural 
coordinates according to Figures 3 and 3A-1 to 3A-79 of residues Argl4, Metl 8, Arg21, 
Gln22, Arg24, Phe25, Arg28, Phe54, Glu58, Ile68, Gly69, Ala70, Ser73 and Phe74 from 
one monomer of ACPS (SEQ ID NO:2), and residue Arg45 from a second monomer of 
ACPS (SEQ ID NO:2), in each case ± a root mean square deviation from the backbone 
atoms of said amino acids of not more than 1 .5 A, or more preferably not more than 
1.0 A, or most preferably, not more than 0.5 A. Alternatively, the active site may include, 
in addition to the structural coordinates define above, the structural coordinates according 
to Figures 3 and 3A-1 to 3A-79 of residues Asp8, Ile9, ThrlO, Glull, Leul2, Ilel5, 
Alal6, Serl7, Alal9, Gly20, Ala23, Ala26, Glu27, Ile29, Ala51, Lys57, Ser61, Lys62, 
Thr66, Gly67, Gln71, Leu72, Gln75, Asp76, Ile77 and Lys93 from the first monomer of 
ACPS (SEQ ID NO:2) and residues Leu41, Ser42, Lys44, Glu48, Gln83, Asn84, Hisl05, 
Thrl06 and Alal07 from the second monomer of ACPS (SEQ ID NO:2), in each case ± a 
root mean square deviation from the backbone atoms of said amino acids of not more 
than 1 .5 A, or more preferably not more than 1 .0 A, or most preferably, not more than 0.5 
A. Preferably, the ACP active site corresponds to the configuration of the ACPS molecule 
in its state of association or inactivation with an agent, and preferably, ACP. 
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In addition, the present invention provides the ACPS active site of an ACP that 
comprises the structural coordinates according to Figures 3 and 3A-1 to 3A-79 or Figures 
5 and 5A-1 to 5A-15 of residues Argl4, Lys29, Asp35, Ser36, Leu37, Asp38, Val40, 
Glu41, Val43, Met44, Ser47, Asp48, Ile54, Ser55, Asp56, Glu57 and Glu60, ± a root 
5 mean square deviation from the backbone atoms of said amino acids of not more than 1 .5 
A, or more preferably not more than 1 .0 A, or most preferably, not more than 0.5 A. The 
amino acid residues are according to the sequence set forth in SEQ ID NO: 1 . 
Alternatively, the active site further includes, in addition to the coordinates defined 
above, the structural coordinates according to Figures 3 and 3A-1 to 3A-79 or Figures 5 

10 and 5A-1 to 5A-15 of residues Asp 13, Leul5, Phe28, Glu30, Asp31, Leu32, Gly33, 

Ala34, Val39, Leu42, Glu45, Leu46, Glu49, Met52, Glu53, Asp58, Ala59, and Lys61, ± a 
root mean square deviation from the backbone atoms of said amino acids of not more 
than 1 .5 A, or more preferably not more than 1 .0 A, or most preferably, not more than 0.5 
A. The amino acid residues are according to the sequence set forth in SEQ ID NO:l . 

15 Preferably, the ACPS active site corresponds to the configuration of the ACP molecule in 
its state of association or inactivation with an agent, and preferably, ACPS. 

Another aspect of the present invention is directed to a method for identifying an 
agent that interacts with an active site of an ACPS-ACP complex, comprising the steps of 
determining an active site of the ACPS-ACP complex from a three dimensional model of 

20 the ACPS-ACP complex and performing computer fitting analyses to identify an agent 
which interacts with said active site. Computer fitting analyses utilize various computer 
software programs that evaluate the "fit" between the putative active site and the 
identified agent, by (a) generating a three dimensional model of the putative active site of 
a molecule or molecular complex using homology modeling or the atomic structural 

25 coordinates of the active site, and (b) determining the degree of association between the 
putative active site and the identified agent. Three dimensional models of the putative 
active site may be generated using any one of a number of methods known in the art, and 
include, but are not limited to, homology modeling as well as computer analysis of raw 
data generated using crystallographic or spectroscopy data. Computer programs used to 

30 generate such three dimensional models and/or perform the necessary fitting analyses 
include, but are not limited to: GRID (Oxford University, Oxford, UK), MCSS 



(Molecular Simulations, San Diego, Calif.), AUTODOCK (Scripps Research Institute, La 
Jolla, Calif.), DOCK (University of California, San Francisco, Calif), Flo99 (Thistlesoft, 
Morris Township, N.J.), Ludi (Molecular Simulations, San Diego, Calif), QUANTA 
(Molecular Simulations, San Diego, Calif), Insight (Molecular Simulations, San Diego, 
Calif), SYBYL (TRIPOS, Inc., St. Louis. Mo.) and LEAPFROG (TRIPOS, Inc., St. 
Louis, Mo.). 

The effect of such an agent identified by computer fitting analyses on ACPS-ACP 
complex activity may be further evaluated by contacting the identified agent with the 
ACPS-ACP complex and measuring the effect of the agent on ACPS-ACP complex 
activity. Depending upon the action of the agent on the active site of ACPS-ACP 
complex, the agent may act either as an inhibitor or activator of ACPS-ACP complex 
activity. Enzymatic assays may be performed and the results analyzed to determine 
whether the agent is an inhibitor of ACPS-ACP complex activity (i.e., the agent may 
reduce or prevent binding affinity between ACPS and ACP, and thereby reduce the level 
or rate of ACPS-ACP activity compared to baseline), or an activator of ACPS-ACP 
activity (i.e., the agent may increase binding affinity between ACPS and ACP, and 
thereby increase the level or rate of ACPS activity compared to baseline). Further tests 
may be performed to evaluate the effect of the identified agent on bacterial or eukaryotic 
cell populations, wherein an inhibitor of ACPS-ACP activity inhibits cell viability or 
reproduction. 

The present invention is not limited to identifying agents which interact with an 
active site of the ACPS-ACP complex, but also is directed to a method for identifying an 
activator or inhibitor of any molecule or molecular complex comprising an ACP binding 
site or an ACPS binding site. The candidate activator or inhibitor is selected or designed 
by performing computer fitting analyses of said candidate agent with the three 
dimensional model of the molecule or molecular complex comprising the active site. 
Once the candidate activator or inhibitor is obtained, it may be contacted with the 
molecule or molecular complex in order to measure the effect the candidate activator or 
inhibitor has on said molecule or molecular complex. 

In this regard, a potential activator or inhibitor of a molecule or molecular 
complex comprising an ACP binding site, is obtained by (a) generating a three 
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dimensional model of said molecule or molecular complex comprising an ACP binding 
site using the relative structural coordinates according to Figures 3 and 3A-1 to 3A-79 of 
residues Argl4, Metl8, Arg21, Gln22, Arg24, Phe25, Arg28, Phe54, Glu58, Ile68, Gly69, 
Ala70, Ser73 and Phe74 from a first monomer of ACPS (SEQ ID NO:2), and residue 
Arg45 from a second monomer of ACPS (SEQ ID NO:2), and (b) selecting or designing a 
candidate activator or inhibitor by performing computer fitting analysis of the candidate 
activator or inhibitor with the three dimensional model generated in step (a). In another 
embodiment, the relative structural coordinates further include the relative structural 
coordinates according to Figures 3 and 3A-1 to 3A-79 of residues Asp8, Ile9, ThrlO, 
Glull, Leul2, Ilel5, Alal6, Serl7, Alal9, Gly20, Ala23, Ala26, Glu27, Ile29, Ala51, 
Lys57, Ser61, Lys62, Thr66, Gly67, Gln71, Leu72, Gln75, Asp76, Ile77 and Lys93 from 
said first monomer of ACPS (SEQ ID NO:2) and residues Leu41, Ser42, Lys44, Glu48, 
Gln83, Asn84, Hisl05, Thrl06 and Alal07 from said second monomer of ACPS (SEQ ID 
NO:2). In each case, the ± a root mean square deviation from the backbone atoms of the 
amino acids is not more than 1 .5 A, preferably not more than 1 .0 A, and most preferably 
is not more than 0.5 A. 

A potential activator or inhibitor of a molecule or molecular complex comprising 
an ACPS binding site, may be obtained by (a) generating a three dimensional model of 
said molecule or molecular complex comprising an ACPS binding site using the relative 
structural coordinates according to Figures 3 and 3A-1 to 3A-79 or Figures 5 and 5A-1 to 
5A-15 of residues Argl 4, Lys29, Asp35, Ser36, Leu37, Asp38, Val40, Glu41, Val43, 
Met44, Glu47, Asp48, Ile54, Ser55, Asp56, Glu57 and Glu60, according to the sequence 
set forth in SEQ ID NO: 1 , and (b) selecting or designing a candidate activator or inhibitor 
by performing computer fitting analysis of the candidate activator or inhibitor with the 
three dimensional model generated in step (a). In another embodiment, the relative 
structural coordinates further include the relative structural coordinates according to 
Figures 3 and 3A-1 to 3A-79 or Figures 5 and 5A-1 to 5A-15 of residues Aspl3, Leul5, 
Phe28, Glu30, Asp31, Leu32 5 Gly33, Ala34, Val39, Leu42, Glu45, Leu46, Glu49, Met52, 
Glu53, Asp58, Ala59, and Lys61 according to the sequence set forth in SEQ ID NO: 1 . In 
each case, the ± a root mean square deviation from the backbone atoms of the amino 
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acids is not more than 1 .5 A, preferably not more than 1 .0 A, and most preferably is not 
more than 0.5 A. 

In addition, the invention provides a solution comprising B. subtilis ACP having a 
three dimensional structure defined by the structural coordinates of Figures 5 and 5A-1 to 
5A-15, ± a root mean square deviation from the backbone atoms of said amino acids of 
not more than 1 .5 A, more preferably not more than 1 .0 A, and most preferably not more 
than 0.5 A. Also provided by the invention is any active site of B. subtilis ACP that is 
defined by the structural coordinates of Figures 5 and 5A-1 to 5A-15, ± a root mean 
square deviation from the backbone atoms of said amino acids of not more than 1 .5 A, 
more preferably not more than 1 .0 A, and most preferably not more than 0.5 A. In 
addition, the invention provides a method for identifying an agent that interacts with any 
active site of B. subtilis ACP, comprising the steps of determining a putative active site of 
ACP from a three dimensional model of the ACP, and performing various computer 
fitting analyses to identify an agent which interacts with the putative active site. Again, 
such agents may act as inhibitors or activators of ACP activity, as determined by 
obtaining the identified agent, contacting the same with ACP, and measuring the agent's 
effect on ACP activity. In the preferred embodiment, the three dimensional structure of B. 
subtilis ACP is defined by the relative structural coordinates of Figures 5 and 5A-1 to 5A- 
15, ± a root mean square deviation from the backbone atoms of said amino acids of not 
more than 1.5 A, or more preferably not more than 1 .0 A, or most preferably, not more 
than 0.5 A. The use of the NMR solution structure of ACP for the identification of 
inhibitor binding sites on ACP, for the determination of the solution structure of ACP- 
inhibitor complexes, and for inhibitor design, is described further below in Examples 3-5. 

Various molecular analysis and rational drug design techniques are further 
disclosed in U.S. Pat. Nos. 5,834,228, 5,939,528 and 5,865,116, as well as in PCT 
Application No. PCT/US98/16879, published WO 99/09148, the contents of which are 
hereby incorporated by reference. 

The present invention is also directed to the agents, activators or inhibitors 
identified using the foregoing methods. Such agents, activators or inhibitors may be a 
protein, polypeptide, peptide, nucleic acid, including DNA or RNA, molecule, 
compound, antibiotic or drug. 
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Finally, the present invention is further directed to a method for determining the 
three dimensional structure of a molecule or molecular complex whose structure is 
unknown, comprising the steps of obtaining crystals of the molecule or molecular 
complex whose structure is unknown and generating X-ray diffraction data from the 
crystallized molecule or molecular complex. The X-ray diffraction data from the 
molecule or molecular complex is then compared with the known three dimensional 
structure determined from the ACPS-ACP crystals of the present invention. Then, the 
known three dimensional structure determined from the crystals of the present invention 
is "conformed" using molecular replacement analysis to the X-ray diffraction data from 
the crystallized molecule or molecular complex. Alternatively, spectroscopic data or 
homology modeling may be used to generate a putative three dimensional structure for 
the molecule or molecular complex, and the putative structure is refined by conformation 
to the known three dimensional structure determined from the ACPS-ACP crystals of the 
present invention. 

The present invention may be better understood by reference to the following 
non-limiting Examples. The following Examples are presented in order to more fully 
illustrate the preferred embodiments of the invention, and should in no way be construed 
as limiting the scope of the present invention. 

Example 1 
Crystal Structure ofACPS/ACP Complex 
L Material and Methods 

Crystallization ofACPS with ACE Purified ACP and ACPS were mixed at a 1 : 1 . 1 
molar ratio and the mixture was loaded onto a gel filtration column. The resulting 
purified complex was dialyzed against a solution containing 50 mM Bis-Tris pH 6.4, 100 
mM NaCl, 10 mM MgCl 2 , and 10 mM DTT before concentrating the complex to -10 
mg/mL. Crystallization conditions for the ACP/ACPS complex were also determined 
using the sparse matrix screens available from both Hampton Research and Emerald 
Biostructures. Screens were set up with 2 \iL drops at both 18° C. and 4° C. Optimization 
of a crystalline hit (0.2M Potassium Formate, 20% PEG 3350) gave diffraction quality 
rod shaped crystals. Crystals could be obtained between 0.1 5M and 0.3M Potassium 
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Formate and between 15 and 23% PEG 3350. The rate of growth seriously affected the 
quality of crystals with the optimal crystals being formed between 8 and 12 days after 
setup. The crystals were grown with a 1 :2 drop ratio of protein to well solution at 1 8° C. 
These crystals belonged to space group C222i with unit cell parameters a=78.46, 
5 b=122.03, c=l 36.77 A and contained three molecules of ACPS and three molecules of 
ACP in the asymmetric unit. 

Data Collection. Data from the ACP/ACPS complex crystals were collected using a It- 
Axis IV mounted on a Rigaku RUH2R rotating anode operating at 5 kW from a single 
10 crystal, cooled to -1 80° C. The data to 2.3 A were collected for the ACP/ACPS complex 
crystal using one-degree oscillations. The data were processed using DENZO and 
Scalepack [23] and the statistics from refinement are given in Table 2. 

Model Building and Refinement. The structure of the ACP/ACPS complex was solved by 

15 molecular replacement using the program AMORE[24], with the trimer of ACPS as 

found in the ACPS/CoA structure as the probe. Prior to refinement, 10% of the data were 
randomly selected and designated as a R free test set to monitor the progress of the 
refinement. The structure of the ACPS trimer was then rebuilt using the X-BUILD feature 
in Quanta utilizing a series of omit maps. During this rebuilding, extra density was found 

20 in each active site that sharpened after each cycle of rebuilding. When the ACPS 

molecules had been rebuilt, the NMR structure of the B. subtilis ACP was rotated into the 
density found in the active site. As a result of a large domain shift, a consequence of 
binding to ACPS, there were enough differences between the NMR structure and the X- 
Ray data that precluded using the NMR model as the starting point for refinement. 

25 Instead, the location of methionine 44 was noted from the NMR structure and the 

remainder of the ACP molecule was built into density using omit maps from that residue. 
Reference to the NMR structure as a source of secondary structure information allowed 
the structure of the three ACP molecules to be built into the electron density rapidly. 

When roughly 80% of the ACP had been built into density, that ACP/ACPS model 

30 was then used as the initial model for refinement using the program CNS [25]. Following 
six cycles of refining and rebuilding the refinement converged with a model which 
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contained 3 molecules of ACPS, 3 molecules of ACPS and 117 solvent molecules at an 
Rcryst of 22.9% and R free of 28.0%. The refinement statistics are given in Table 3. 

2. Results and Discussion 
5 Needle-like crystals (0. 1 .times.0. 1 .times.0.5 mm) were grown using the hanging 

drop method from an equal molar solution of ACP and ACPS. These crystals belong to 
space group C222j with unit cell parameters a=78.46, b=122.03, c=136.77 A and diffract 
to 2.3 A using a R-Axis IV mounted on a Rigaku RUH2R rotating anode operating at 
5 kW. These cell dimensions correspond to the asymmetric unit containing 3 molecules of 

1 o ACPS and 3 molecules of ACP. The sequence of the ACP (SEQ ID NO: 1 ) and ACPS 
(SEQ ID NO:2) used in obtaining these crystals is shown in Figure 1 . The 
phosphopantetheinyl group that is attached to the O y of Ser-36 from ACP (SEQ ID NO: 1 ) 
is not indicated in this figure. 

The contacts between holo-ACP and ACPS are predominately hydrophilic in 

15 nature with almost all of the interactions occurring between helix a\ of ACPS and helix 
q2 of ACR There are only two significant hydrophobic contacts and these both involve 
residues (Leu-37 and Met-44) from ACP (SEQ ID NO: 1) protruding into hydrophobic 
pockets on ACPS. Leu-37 extends into a pocket formed by Met-18, Phe-25, Phe-54 and 
He- 15 on ACP (SEQ ID NO:l) while Met-44 binds in a pocket formed by Phe-25 and the 

20 aliphatic portion of the side chains from Arg-28 and Gln-22. Table 1 details the 
hydrophilic interactions. 

Examination of this structure suggests that a key residue in the binding of ACP to 
ACPS is Arg-14 from ACPS (SEQ ID NO:2). Arg-14 forms a salt bridge with the residue 
just before the reactive serine (Asp-35) of ACP (SEQ ID NO:l) and is involved in 

25 hydrogen bonding with Asp-38, two residues after the reactive serine. These interactions 
serve to position the ACP (SEQ ID NO: 1) molecule so that one end of helix o2 from ACP 
(SEQ ID NO:l) is placed at the bottom of the active site and correctly orients Ser-36. As 
shown in Figure 2, Arg-14 is conserved in all ACPS proteins except that from 
Mycobacterium in which it is an aspartic acid (SEQ ID NO: 14). Another arginine, 

30 Arg-21, of ACPS (SEQ ID NO:2) forms a salt bridge with Glu-41 from ACP (SEQ ID 
NO:l). The other end of helix o2 is positioned by the interactions of Arg-24 and Gln-22 
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from ACPS (SEQ ID NO:2) with Asp-48 of ACP (SEQ ID NO:l). These interactions, 

along with the two hydrophobic "keys" described above, lock the ACP into place. 

When the structure of ACPS/CoA and ACPS/ACP are superimposed (not shown), 

a loop consisting of residues 64 to 78 enlarges the active site by shifting 2 A to 
5 accommodate helix o3 from ACP. Additionally, the dipole of the oil helix of ACP is 

directed at the phosphate of the CoA that is to be transferred to ACP. 

Since there is a significant rearrangement of ACP upon binding to ACPS and there 

are only limited interactions between the ACP and ACPS, it is not surprising that the B- 

values indicate the ACP molecules are very mobile. The average B of the main chain 
10 atoms of the three ACPS molecules is 40.1 A 3 while that of the three ACP molecules is 

72.3 A 3 . 



Table 1. Hydrophilic Interactions between ACP and ACPS 

ACP Residue ACPS Residue Distance (A) 

15 (SEQ ID NO: 1 ) (SEQ ID NO:2) 

Hydophilic 



9 


Lys 


NZ 


308 Glu 


OE1 


3.15* 


35 


Asp 


OD1 


214Arg 


NH2 


2.81 


35 


Asp 


OD2 


214Arg 


NH1 


2.50 


35 


Asp 


OD2 


214 Arg 


NH2 


3.05 


38 


Asp 


OD1 


214 Arg 


NH2 


3.05 


38 


Asp 


OD2 


214 Arg 


NH2 


3.20 


41 


Glu 


GE1 


221 Arg 


NH2 


2.66 


41 


Glu 


OE2 


221 Arg 


NE 


2.91 


48 


Asp 


OD1 


224 Arg 


NH1 


2.95 


54 


lie 


0 


228 Arg 


NH2 


2.60 


56 


Asp 


OD2 


273 Ser 


OG 


2.50 


56 


Asp 


OD2 


274 Phe 


N 


2.85 


60 


Glu 


OE2 


270 Ala 


N 


2.97 



30 

♦denotes a symmetry related molecule 
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Table 2. Residues of ACPS (SEQ ID NO:2) from Figure 1 that were modeled as alanine 
due to poor electron density beyond CB for ACPS long chain residue 





Amino Acid 


Chain A 


Chain B 


Chain C 


5 


Lysl3 


Ala 


A 1 

Ala 


T _ 

Lys 




Arg21 


Ala 


Arg 


Ala 




Arg32 


Ala 


Arg 


Ala 




Glu40 


Ala 


Ala 


Glu 




Glu43 


Glu 


Ala 


Ala 


10 


Arg45 


Arg 


Ala 


Arg 




Arg70 


Arg 


Ala 


Ala 




Lys81 


Lys 


Ala 


Lys 




Gln83 


Ala 


Gin 


Gin 




Lys86 


Lys 


Ala 


Lys 


15 


Gln96 


Gin 


Gin 


Ala 




Lysl07 


Lys 


Ala 


Ala 
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Table 3. Statistics for Data Collection, Phase Determination, and Refinement 



Data Collection 


a r*T)j a roc 
A Lr/A Cro 


Wavelength (A) 


X=1.54 


resolution range (A) 


15-2.3 


p a 
Emerge 


5.7%(56.0) 


% complete 


97.7(94.5) 


total reflections 


270151 


unique reflections 


29694 


1/(7 (I) 


26.6(3.5) 



15 



20 



25 



Model Refinement 
Maximum Resolution (A) 



Rwork 
Rfree(°/(, 

<B value> (A 3 ) 
R.m.s. Deviations from ideal geometry for 

Bonds (A) 

Angles (°) 

B values (A 2 ) 
Non-hydrogen Protein Atoms 
Water Molecules 
Ions 

Other Molecules 



ACP/ACPS 

2.3 
22.9 
28.0 
52.3 

0.0156 

1.80 

2.054 

6972 

117 

none 

none 



<I h > !/2/Ih, where <Ih> is the average intensity over symmetry equivalents. 
Number in parentheses reflect statistics for the last shell 



Rmerge 
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b Rwork = ViViYo^A - ViY^Vil ViFobs 1 /^ Rfree is equivalent to R WO rk, but calculated for a 
randomly chosen 5% (or 10%) of reflections omitted from the refinement process. 
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Table 4. 

Residues from ACPS (SEQ ID NO:2) which are within 4A of ACP. 

From one molecule of ACPS (SEQ ID NO: 2): 

5 Arg-14, Met-1 8, Arg-21 , Gln-22, Arg-24, Phe-25, Arg-28, Phe-54, Glu-58, Ile-68, Gly- 
69, Ala-70, Ser-73, Phe-74 

From a Second Molecule of ACPS (SEQ ID NO: 2) 
Arg-45 

10 

Residues from ACP (SEQ ID NO: 1) which are within 4 A of the ACPS dimer. 

Arg-14, Lys-29, Asp-35, Ser-36, Leu-37, Asp-38, Val-40, Glu-41, Val-43, Met-44, Glu- 
47, Asp-48, Ile-54, Ser-55, Asp-56, Glu-57, Glu-60 

15 

Table 5. 

Residues from ACPS (SEQ ID NO:2) which are within 8A of ACP. 

From one molecule of ACPS (SEQ ID NO: 2): 

20 Asp-8, Ile-9,Thr-10, Glu-11, Leu- 12, Arg-14, Ile-15, Ala-16, Ser-17, Met-18 Ala-19, 

Gly-20, Arg-21, Gln-22, Ala-23, Arg-24, Phe-25, Ala-26, Glu-27, Arg-28, Ile-29, Ala-51, 
Phe-54, Lys-57, Glu-58, Ser-61, Lys-62, Thr-66, Gly-67, Ile-68, Gly-69, Ala-70, Gln-71, 
Leu-72, Ser-73, Phe-74, Gln-75, Asp-76, Ile-77, Lys-93 

25 From a Second Molecule of ACPS (SEQ ID NO: 2): 

Leu-41, Ser-42, Lys-44, Arg-45, Glu-48, Gln-83, Asn-84, His-105, Thr-106, Ala-107 

Residues from ACP (SEQ ID NO:l) which are within 8 A of the ACPS dimer. 

Asp-13, Arg-14, Leu-15, Phe-28, Lys-29, Glu-30, Asp-31, Leu-32, Gly-33, Ala-34, Asp- 
30 35, Ser-36, Leu-37, Asp-38, Val-39, Val-40, Glu-41, Leu-42, Val-43, Met-44, Glu-45, 
Leu-46, Glu-47, Asp-48, Glu-49, Met-52, Glu-53, Ile-54, Ser-55, Asp-56, Glu-57, Asp- 
58,Ala-59, Glu-60, Lys-61 
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Example 2 

Determination ofNMR Solution Structure ofACP 
1. Material and Methods 

B. subtilis ACP Sample Preparation. The uniform 15N and 13C-labeled B. subtilis ACP 
5 (SEQ ID NO:l) was cloned into pGEX-6P-l vector and expressed in E. coli strain 

BL21DE3 (pLysS) similar to the conditions previously reported for expression of ACPS 
[26], except that 0.5 mM IPTG was used for induction. Purification was done using the 
following procedure. Typically, 20 grams of cell pellet expressing GST- ACP fusion 
protein was resuspended in 300 ml breaking buffer consisting of 50 mM Tris CI (pH 8.0), 

10 300 mM NaCl, 10 mM MgCl 2 and 2 mM of freshly prepared MnCl 2 . Protease inhibitor 
tablets (Boehringer Mannheim GmbH, Mannheim, Germany), RNase H and DNase I 
(Sigma Chemical Co., St. Louis, Mo.) were added to the solution to prevent protease 
activity and to decrease viscosity of the solution. The cells were lysed by three passages 
through a Microfluidizer and the whole lysate was rocked at room temperature for one 

15 hour to enable the conversion of holo-ACP to apo-ACP by an endogenous ACP hydrolase 
from E. coli [27]. Once the incubation was finished, the lysate was centrifuged at 15,000 
g for 20 minutes at 4° C. to remove the cell debris. Glutathione sepharose 4B resin 
(Amersham Pharmacia Biotech, Piscataway, N.J.) equilibrated with the same breaking 
buffer was added to the clear supernatent to a rough ratio of 1 ml resin slurry per 8 mg of 

20 GST fusion protein. The mixture was incubated at 4°C. for one hour by gentle rocking 
and centrifuged at 3,000 g for 10 minutes to remove excess supernatent prior to packing 
the resin into a suitable column. The column was then washed with 5 column volume of 
washing buffer containing 50 mM Tris CI (pH 8.0), 10 mM MgCl 2 , 5 mM DTT. The 
GST-ACP was eluted with wash buffer plus 60 mM freshly prepared reduced glutathione. 

25 The resulting GST-ACP solution was dialyzed overnight with Prescission Protease 

Cleavage buffer consisting of 50 mM Tris CI (pH 8.0), 150 mM NaCl, 1 mM EDTA and 
1 mM DTT. The fusion protein was cleaved with Prescission Protease (Amersham 
Pharmacia Biotech, Piscataway, N.J.) at room temperature for three hours at a ratio of 1U 
enzyme per 500 ^g protein. The resulting protein mixture was loaded onto a MonoQ 

30 HR1 6/1 0 column (Amersham Pharmacia Biotech) equilibrated with 50 mM Tris CI; 
pH 8.0, 150 mM NaCl and 10 mM MgCl 2 . 



NMR Data Collection. The NMR sample is a mixture of l5 N-, 13 C-double labeled Apo- 
and Holo-ACP in 50 mM Bis-Tris (pH 6.4), 100 mM NaCl, 10 mM MgCl 2 and 10 mM 
DTT with 0.02% NaN 3 in 5% D 2 0/95% H 2 0 solution. The protein concentration was 
5 <\ mM. 

All spectra were recorded at 25° C on Varian Unity + 600 spectrometer equipped 
with triple-resonance 'H/^C/^N probe and an actively shielded z-gradient pulsed field 
accessories. 2D- ! H- 15 N HSQC and all triple-resonance 3D experiments were recorded 
with the enhanced-sensitivity pulsed field gradient approach [28]. This approach provides 

10 coherence transfer selection both to improve sensitivity and eliminate artifacts as well as 
for solvent suppression. 

Data sets were typically processed and displayed on SGI workstation using the 
program packages NMRDraw and NMRPipe [29]. A skewed 600 phase-shifted sine-bell 
function and a single zero-filling was used in each of the all three dimensions prior to 

15 Fourier transformation. For triple-resonance 3D experiments, the time domain was 
extended by a factor of two using forward-backward linear prediction in the l5 N (t2) 
dimension and for constant-time ! H- 13 C correlation experiments, mirror image linear 
prediction was used prior to zero-filling to the double time-domain data points [30]. The 
programs PIPP and STAPP [31] were used for data analysis and semi-automatic 

20 assignments [30]. 

The complete assignments (>95% ) of the ! H, l5 N and 13 C resonances were based 
on the following experiments: CBCA(CO)NNH, HNCACB, C(CC)TOCSY_NNH, 
H(CC)TOCSY_NNH, HAHB(CBCACO)NNH [28,32]. 2D ,3 C (methyl)-lH HSQC and 
methyl relay experiments used for auxiliary methyl assignments of He, Val and Leu 

25 residues [33-35]. Some ambiguous resonances were further confirmed by simultaneous 
15 N/ 13 C-edite-dNOESY [36]. 

B. subtilis ACP Structure Calculation. The NMR solution structure is based on 
interproton distance constraints converted from observed NOEs in both the i5 N- edited 
30 NOESY [37,38] and simultaneous l5 N/ l3 C-edited NOESY experiments [36]. The NOEs 
were classified as either strong (1.8-2.7 A), medium (1.8-3.3 A) or weak (1.8-5.5 A) 



constraints. $ and \p torsion angle constraints were obtained from 15 N, Ha, Co: and CB 
chemical shifts using the TALOS program [39]. Upper distance limits for distances 
involving methyl protons and non-stereospecifically assigned methylene protons were 
corrected appropriately for center averaging [40], and an additional 0.5 A was added to 
5 upper distance limits for NOEs involving methyl protons [41 ,42]. 

The structures were calculated using the hybrid distance geometry-dynamical 
simulated annealing method of Nilges et al. (1988) [43] with minor modifications [44] 
using the program XPLOR [45], adapted to incorporate pseudopotential secondary 
13 Co/ 13 CB chemical shift restraints [46] and a conformational database potential [47,48]. 

10 The target function that is minimized during restrained minimization and simulated 

annealing comprises only quadratic harmonic terms for covalent geometry and secondary 
13 C<y 13 CB chemical shift restraints, square-well quadratic potentials for the experimental 
distance and torsion angle restraints, and a quartic van der Waals term for non-bonded 
contacts. All peptide bonds were constrained to be planar and trans. There were no 

15 hydrogen-bonding, electrostatic or 6-12 Lennard-Jones empirical potential energy terms 
in the target function. 

The structure of B. subtilis ACP was determined from a total of 1050 distance 
constraints comprising 337 intra-residue, 231 sequential, 188 medium, and 240 long 
range distance constraints, 54 hydrogen bond constraints and 92 torsion angles 

20 constraints comprised of 46 <f> and 46 \p dihedral constraints. The hydrogen bond 

constraints were based on the observation of slow exchanging NH protons in a D 2 0 
solution monitored by an l H- 15 N HSQC spectrum. 

The final ensemble of 22 structures contained no distance constraint violations 
greater than 0.2 A and no torsion angle constraint violations greater than 2°. The NMR 

25 structures are well defined. This is evident by the atomic rms distribution of the 22 

simulated annealing structures about the mean coordinate positions where the backbone 
and all atom rms is 0.45 A and 0.93 A, respectively. For residues only in secondary 
structure regions, the backbone and all atom rms is 0.35 A and 0.84 A, respectively. The 
B. subtilis ACP NMR structure is consistent with a good quality structure based on 

30 PROCHECK and Ramachandran analysis [49,50]. A Ramachandran plot of the 

minimized average structure shows a total of 83.1% of the residues are in the most 



favored region, 12.7% in the additional allowed, and 2.8% in the generously allowed 
regions with only one residue (Vail 7) in a disallowed region (based on residues 6-81 of 
ACP). Vail 7 is located in a long loop between helices 1 and 2 that corresponds to a very 
flexible region of the protein. PROCHECK analysis indicates an overall G-factor of - 
0.23, a hydrogen bond energy of 0.9 and only 2 bad contacts. 

2. Results and Discussion 

Introduction. The biosynthesis of fatty acids consists of a series of reactions catalyzed by 
specific enzymatic activities [51]. The organization of the enzymatic activity is 
significantly different between eukaryotic cells and prokaryotic and plants cells. In 
eukaryotic cells large multifunctional enzymes exist with distinct domains associated 
with a particular function. Conversely, in prokaryotic and plant cells, the various 
enzymatic activities is associated with individual proteins that are loosely associated with 
each other. Acyl carrier protein (ACP) is a discrete small acid protein (9 KDa) in 
prokaryotic and plant cells that plays an essential role in fatty acid biosynthesis; whereas, 
ACP is a subunit of fatty acid synthetase (FAS) in animal tissue. ACP is the carrier of 
fatty acids during fatty acid biosynthesis in prokaryotic and plant cells and is responsible 
for acyl group activation [51-53]. 

A unique feature of ACP is the presence of the 4 f -phosphopantetheine (P-pant) 
prosthetic group. The P-pant moiety is attached through a phosphodiester linkage to a 
specific conserved serine residue found in all ACPs. ACP exists in both an active (holo) 
and inactive (apo) form where activation of ACP is mediated by Holo-acyl carier protein 
synthase (ACPS). ACPS transfers the P-pant moiety from CoA to Ser-36 of Apo-ACP 
(SEQ ID NO:l) to produce holo- ACP and 3',5 ? -ADP in a Mg +2 -dependent reaction. 
During biosynthesis of a long-chain fatty acid, the fatty acid chain is attached to ACP via 
a thioester linkage to the terminal cysteamine thiol of the P-pant prosthetic group where 
the fatty acid chain is then elongated by the fatty acid synthetase system. A potential 
function of the P-pant prosthetic group is to act as a tether to transfer the growing fatty 
acid chains between the various enzymes or active sites in the FAS system. 

ACP is a central component and plays a fundamental role in fatty acid and other 
biosynthetic pathways that require acyl transfer steps [54, 55-58]. The activation of ACP 
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by ACPS is critical to this function where ACPS was identified as critical for the viability 
of E. coli [59,60], Furthermore both ACP and ACPS are viable targets for a drug 
discovery program since the enzymes are essentially unique to prokaryotic cells. Since 
the activation of ACP is mediated by its interaction with ACPS, interfering with either the 
5 activity of ACPS or the binding interaction of ACPS with ACP may prove to be a 
valuable approach for developing novel antibiotics. 

NMR Data. NMR data was collected on both the apo- and holo- forms of ACP. 
While the NMR spectra indicate distinct chemical shifts for the NH proton and amide- 15 N 
resonances for residues in the vicinity of the 4'-PP prosthetic group, the NMR data 

10 indicate that the structures for apo-ACP and holo- ACP are effectively identical. 

Uniqueness of the B. subtilis ACP NMR Structure. Structures for E. coli and 
Streptomyces coelicolor A3(2) ACP were reported in the literature prior to initiation of 
our efforts on the structure determination of B. subtilis ACP [61-63]. Amino acid 
sequence alignments indicate that E. coli ACP is highly homologous to B. subtilis ACP 

15 where 53 of 76 residues are identical residue types (46) and identical residue classes (7). 
Comparison of Streptomyces coelicolor A3 (2) ACP with B. subtilis ACP indicate 38 of 
76 residues are identical residue types (17) and identical residue classes (21). The overall 
sequence homology suggests that the three proteins should have a similar global fold 
(Figure 4). 

20 Comparison of the published structures for E. coli and Streptomyces coelicolor 

A3 (2) ACP with B. subtilis ACP indicate similar secondary structure elements for the 
three proteins. The overall ACP structure consists of a four cehelical bundle where three 
Of-helices are relatively long (6-1 5 residues) and one helix is short (0-6 residues). Despite 
the similarity in the secondary structure features the global fold for the three ACP 

25 structures is distinct. This is readily apparent by the superposition of the average- 
minimized three-dimensional structures for the three proteins (not shown). The atomic 
rms deviation of the Cof trace between E. coli and B. subtilis ACP is 2.32 A. Similarly, the 
deviation of the Co; trace between Streptomyces coelicolor A3(2) and B. subtilis ACP is 
2.3 1 A. Although the previous structures of E. coli ACP and Streptomyces coelicolor 

30 A3(2) are of poor quality, the extremely large rms differences between E. coli ACP, 
Streptomyces coelicolor A3(2) and B. subtilis ACP indicate that each structure is 



relatively unique and that it would not be possible to predict the structure of B. subtilis 
ACP from the structures of E. coli ACP and Streptomyces coelicolor A3 (2). 

The observed large rms deviations between the three ACPS structures are located 
mainly in the short ohelix 3 and the long loop region between Qf-helix 1 and 2. The short 

5 a-helix 3 is not present in some models of both E. coli and Streptomyces coelicolor A3 (2) 
ACP and is not present in the average minimized Streptomyces coelicolor A3(2) ACP 
structure. Some of the observed differences between the B. subtilis ACP structure and 
both the E. coli and Streptomyces coelicolor A3 (2) ACP structures result from unusual 
features of the E. coli and Streptomyces coelicolor A3(2) ACP structures. An example of 

10 an unusual feature is the presence of a large kink in Qf-helix 1 for E. coli ACP that results 
in this helix being extremely distorted. The observation of distinct structures for the three 
ACP proteins is unexpected given the reasonable sequence homology and the obvious 
fact that the proteins are functionally identical. A potential cause for the structural 
difference may be a function of the structure determination process instead of a difference 

1 5 that may be attributed to the origin of the proteins. 

The available structural information for ACP has been obtained by NMR 
methodologies over a span of -12 years. During this time-period NMR technology has 
been vastly improved resulting in the ability to obtain high-resolution structures of 
increasingly larger proteins [64-66]. As a result, the methodology applied to determining 

20 the B. subtilis ACP structure is inherently superior to the techniques used for the E. coli 
and Streptomyces coelicolor A3 (2) ACP structures. Invariably, the precision and 
accuracy of a protein structure determined by NMR is dependent on the number and 
reliability of the structural constraints interpreted from the NMR data [64,67]. The 
inherent reliability of the interpretation of the structural constraints is dependent on the 

25 number of available constraints. This relationship exists since a given structure has to be 
consistent with all the available constraints. So, the more constraints that are available for 
determining a structure the higher the likelihood that erroneous data will be identified by 
being inconsistent with the abundance of correct data. Additionally, the nature of the 
structural constraint is critical in relationship to the accuracy of the overall structure. 

30 Intra-residue constraints convey a localized structural effect usually contributing to the 
residues torsion angles whereas long-range inter-residue constraints will determine the 



overall fold of the protein. Therefore, a protein structure with an abundance of intra- 
residue constraints and a minimal number of long-range constraints will result in a 
relatively low-resolution structure. 

The structures for E. coli and Streptomyces coelicolor A3(2) ACP were based on a 
5 minimal number of distance constraints, especially long-range distance constraints, 
relative to the B. subtilis ACP structure. E. coli ACP (SEQ ID NO: 1 5) (77 residues) 
structure was based on a total of 478 distance constraints comprising 30 H-bond distance 
constraints, 101 intra-residue distance constraints and 205 sequential, 87 short-range and 
55 long-range inter-residue constraints. The average number of distance constraints was 

10 only 6.2 constraints per residue. Similarly, the Streptomyces coelicolor A3(2) ACP (SEQ 
ID NO: 16) (86 residues) structure was based on a total of 747 distance constraints 
comprising 48 H-bond distance constraints, 240 intra-residue constraints, 235 sequential, 
131 short-range and 93 long-range distance constraints. The average number of distance 
constraints for Streptomyces coelicolor A3(2) ACP (SEQ ID NO: 16) was only 8.7 

15 constraints per residues. Conversely, our B. subtilis ACP (SEQ ID NO: 1) (76 residues) 
structure is based on a total of 1050 distance constraints with an average of 13.8 
constraints per residue. Similarly, the B. subtilis ACP structure is based on more <f>, \p 
dihedral angle constraints relative to both E. coli and Streptomyces coelicolor A3 (2) 
ACP. A total of 4> and \p dihedral angle constraints were used for the B. subtilis ACP 

20 structure compared to 54 and 63 for the E. coli and Streptomyces coelicolor A3 (2) ACP 
structures, respectively. In addition, the B. subtilis ACP structure was refined using both 
Ca/CR chemical shifts constraints and a conformational database potential which were 
not used for determining the E. coli and Streptomyces coelicolor A3 (2) ACP structures. 
In addition to the number of constraints, the quality of the ACP structures is also reflected 

25 by the rms difference between each structure in the ensemble relative to the average 

structure. Typically, a high resolution NMR structure exhibits a backbone rms of <0.5 A 
[64]. As apparent in Table 6, the structures for E. coli and Streptomyces coelicolor A3(2) 
ACP have extremely high rms values suggestive of a low to poor quality structure, 
whereas, B. subtilis ACP conforms to a rms value consistent with a high quality structure. 
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Table 6: Atomic rms Differences (A)' 



E. coli ACP Streptomyces coelicolor A3 (2) ACP b B. subtilis ACP 

All Residues 

Backbone 2.3 1.47 0.45 

All atoms 3.3 1.84 0.93 



Secondary Structure 

10 Backbone N.D. 1.01 0.35 

All atoms N.D. 1.45 0.84 

''The NMR ensemble for the E. coli, Streptomyces coelicolor A3(2) and B. subtilis ACP 
structures consist of 7, 24 and 22; respectively. b Only residues 5-86 were used for the 
rms. 

15 

There is additional evidence that indicates that the previous structural efforts 
related to ACP were problematic which may imply that the previous ACP structures may 
be inaccurate relative to the structure of B. subtilis ACP. The NMR structure for E. coli 
ACP was published in 1988, further modified in 1990 and finally released by the PDB in 

20 1993 (PDB: 1 ACP). In fact, two separate models for E. coli ACP were deposited in the 
PDB, where one model is described as "Not Energetically Ideal" and the authors suggest 
multiple conformers. Structural information for Spinach ACP and the nodulation protein 
NodF from Rhizobium leguminosarum, which shares homology with ACP were also 
published, but the NMR data was of too low a quality to determine and release a three 

25 dimensional structure [68,69]. These results clearly suggest an inherent technical 

difficulty that was encountered with the previous ACPs structures that was not a factor in 
the B. subtilis ACP structure. 

The uniqueness of the three structures and more critically the inherent value and 
accuracy of the B. subtilis ACP was also apparent from the molecular replacement efforts 

30 for solving the X-ray structure of the B. subtilis ACP- ACPS complex. It was not possible 
to solve the X-ray structure of ACP in the B. subtilis ACP-ACPS complex using the E, 



coli ACP structure. A solution to the X-ray structure of ACP in the B. subtilis ACP-ACPS 
complex was only obtained when the NMR B. subtilis ACP structure was used as part of 
a molecular replacement approach. 

Example 3 

Identification of Inhibitor Binding Sites on ACP 
Inhibitors of the ACPS conversion of apo-ACP to holo-ACP were analyzed for 
direct binding to either ACP or ACPS by NMR. Inhibitor binding to ACP was monitored 
by chemical shift perturbations in a 2D ^-^N HSQC spectra. The observation of 
chemical shift perturbations in a 2D ] H- 15 N HSQC spectra indicate both an interaction 
between ACP and the inhibitor and the location of the inhibitor binding site. The NMR 
assignments for free ACP was utilized to identify which residues have changed in the 
ACP-inhibitor complex. Further identification of the binding site was obtained by 
superimposing the perturbed residues onto the NMR structure of ACP. All of the residues 
that experience chemical shift changes in the presence of the inhibitor occur on a loop 
region corresponding to residues 53-56. This loop is spatially proximal to the conserved 
serine (S3 6) that is attached through a phosphodiester linkage to the 
4'phosphopantetheine (P-pant) prosthetic group. The identification of the location of the 
P-pant prosthetic group was determined by chemical shift differences between the apo- 
and holo- forms of ACP. The proximal location of the potential inhibitor-binding site with 
the P-pant binding site suggests two possible mechanisms for inhibition of the ACP- 
ACPS activity. The activity of the inhibitor could be attributed to disruption of the 
binding of ACP with ACPS or it may stericly prevent the addition of the P-pant prosthetic 
group to ACP from CoA. 

Specificity for the inhibitor to ACP is also monitored by its ability to bind ACPS. 
Inhibitor binding to ACPS was monitored by line- width changes in one-dimensional *H 
titration studies. An effect of the large molecular- weight difference between ACPS and a 
small molecular inhibitor is the corresponding difference in the observable NMR line- 
widths. If the small molecule binds ACPS, it will demonstrate an apparent molecular 
weight similar to ACPS resulting in a dramatic increase in the NMR line-widths for the 



32 



small molecule. Inhibitors identified to affect the ACP-ACPS activity have been shown to 
bind either ACP orACPS. 

Example 4 

Use of the ACP NMR Structure to Determine 
the Solution Structure of ACP -Inhibitor Complexes 
When an appropriate ACP inhibitor has been identified, a structure for the ACP 
complexed to the inhibitor may be determined from the following procedure. 

NMR Data Collection. NMR sample preparation and data collection and processing were 
as described in Example 2, with the addition of the inhibitor in either a molar excess or a 
1 : 1 molar ratio with ACP. 

NMR Assignments. The assignments of the l H, 15 N, and 13 C resonances of ACP in the 
ACP-inhibitor complex are based on a minimal set of experiments: 2D 'H-^N HSQC, 3D 
15 N- edited NOESY [37,38], CBCA(CO)NH [30], C(CO)NH [71], HC(CO)NH, [71], 
HNHA [72] and HNCA [73]. 

The nearly complete resonance assignments for ACP provided the starting point 
for the assignments of ACP in the new inhibitor complex. Three important observations 
facilitated these assignments and provided a simple "boot-strap" approach using a 
minimal set of NMR experiments. First, as apparent by the chemical shift perturbations in 
a 2D ] H- 15 N HSQC spectra, >90% of the ACP residues are unperturbed by the presence 
of the new inhibitor. In fact, for inhibitors that bind ACP in a similar manner the 
resonance assignments for ACP in the complex will be very comparable and greatly 
facilitate the assignment process. This indicates that the majority of the ACP structure is 
unaffected and that only residues in close proximity to the new inhibitor may incur a 
significant chemical shift change. Therefore, the backbone assignments of residues in the 
vicinity of the inhibitor may be obtained by following sequential NOE connectivities in 
the 3D 15 N-edited NOESY spectra by starting with unaffected residues sequential to 
perturbed residues. 
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Second, while significant ! H and 15 N chemical shift perturbations occur for 
residues in the vicinity of the inhibitor, the general NOE pattern may be intact. Simple 
comparison of the 3D 15 N-edited NOESY spectra of ACP and the new complex may 
readily identify the sequential and intra-residue NOEs in the ACP inhibitor spectra. This 
5 provides a straight-forward approach to side-chain ! H assignments. Third, 13 C chemical 
shifts generally do not incur any significant chemical shift perturbations even for residues 
in close proximity to the new inhibitor. 

The resonance assignments and bound conformation of the inhibitor in ACP- 
inhibitor complex are based on the 2D 12 C/ ,2 C-filtered NOESY [74,75], 2D ,2 C/ 12 C- 

10 filtered TOCSY [74,75] and 12 C/ 12 C-filtered COSY experiments [76]. The ACP-inhibitor 
NMR sample is composed of 13 C/ 15 N labeled ACP and unlabeled inhibitor. Thus, 
traditional 2D-NOESY, COSY and TOCSY spectra of the inhibitor in the presence of 
ACP were determined from 2D 12 C-filtering experiments [74-76] where only crosspeaks 
between protons attached to 12 C carbons are observed. This efficiently filters all protein 

15 resonances and allows for the straight- forward analysis of the inhibitor spectrum. 

The ACP-inhibitor structure is based on the following series of spectra: HNHA 
[72], HNHB [77], 3D long-range l3 C- 13 C correlation [78], coupled CT-HCACO [79,80], 
HACAHB-COSY [81], 3D 15 N-[37,38] and ,3 C-edited NOESY [82,83], 3D 13 C- 
edited/ 12 C-filtered NOESY [84], 2D 12 C / 12 C -filter- ed NOESY [74,75] and 15 N-edited 

20 ROESY [85]. The 15 N-edited NOESY, 13 C-edited NOESY, 2D 12 C / 12 C -filtered NOESY, 
3D 13 C-edited/ 12 C-filtered NOESY and 15 N-edited ROESY experiments were collected 
with 100 msec, 120 msec, 100 msec, 110 msec and 40 msec mixing times, respectively. 

The ACP-inhibitor structure is based on the observed intermolecular and 
intramolecular NOEs from the inhibitor observed in the 3D 15 N-edited NOESY [37,38], 

25 2D 12 C / l2 C-filtered NOESY [74,75], 3D 13 C-edited/ 12 C -filtered NOESY [84]. 

Structure Calculations. The structure calculations and distance restraints are used as 
described in Example 2 with the following modifications. The restraints used for the 
refinement of the ACP-inhibitor NMR structure are amended with the distance restraints 
30 observed between ACP and the inhibitor from the 3D 13 C-edited/ 12 C-filtered NOESY and 
3D 15 N-edited NOESY experiments and the intra-molecular restraints observed for the 



inhibitor from the 2D 12 C-filtered NOESY experiment. Additionally, the ACP NMR 
restraints are modified as appropriate for residues in the vicinity of the active site. This 
permits the structure of the ACP active site to be determined primarily by the observed 
inter-molecular NOEs between ACP and the inhibitor. Also, the ACP-inhibitor complex 
5 may be refined using the 3 J N Ha coupling constants determined from the HNHA [72] 

experiment and secondary 13 Co/ 3 CB chemical shift restraints from the assignments for the 
complex. 

Generation of the bound conformation of the inhibitor followed the general 
procedure described for ACP in Example 2 with the following modifications. The bound 

10 conformation for the inhibitor is generated using QUANTA97 and CHARMM 

(Molecular Simulations Inc., San Diego) and the XPLOR topology and parameter files is 
generated using XPLOR2D [86]. Generation of the bound conformation of the inhibitor 
follows the following general procedure. The initial inhibitor structure is created using 
the QUANTA97 2D-sketcher application and is subjected to 500 steps of CHARMM 

15 minimization. NOE restraints were created using the CHARMM distance/dihedral 

constraint option. The NOE scaling constant is set to 500 and the structure is subject to an 
additional 500 steps of CHARMM minimization. 

The starting ACP-inhibitor complex structure for the simulated-annealing protocol 
is then obtained by manually docking the bound conformation of the inhibitor into the 

20 NMR structure determined for ACP using QUANTA97. The inhibitor was then subjected 
to a 1000 steps of restrained CHARMM minimization using the inhibitor intramolecular 
and intermolecular NOE restraints while keeping coordinates for ACP fixed. This 
approach approximates the positioning of the inhibitor in the active site of ACP without 
distorting the ACP structure. The final structure is exported as a PDB file and used as the 

25 starting point for the standard XPLOR simulated annealing protocol where all residues in 
the structure are free to move. 

Example 5 
Inhibitor Design 

30 General. There are a number of computational software packages that may be used for 
the analysis of protein NMR structures. In this case, the software packages Sybyl V.6.4+ 
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to v.6.5+ from Tripos Associates and QUANTA97 (Version 97.1003) an XPLOR (Version 
3.840) from MSI were the packages used. Once the coordinates have been determined by 
NMR a number of steps may be taken as listed below: 

1. The original coordinates are read into the software package and the three- 

5 dimensional structure is analyzed graphically. In addition, programs within QUANTA 
check for the correctness of the NMR coordinates with regard to features such as bond 
and atom types. 

2. The modified (if necessary) structure is energy minimized using the 
QUANTA/CHARM until all the structural parameters are at their equilibrium/optimal 

10 values. 

3. The energy minimized structure is superimposed against the original NMR 
structure to ensure there are no significant deviations between the original and minimized 
coordinates. 

4. The protein-native ligand complex is analyzed, the interactions between the 
15 native ligand and the protein are identified. The uncomplexed structure binding site is 

compared to the complexed structure's binding site for areas which may be exploited by a 
potential inhibitor. 

5. The final protein structure bound to the inhibitor is modified by removing the 
inhibitor so only the protein and a few residues of the natural ligand are left for analysis 

20 of the binding site cavity. The natural ligand residues are docked into the uncomplexed 
structure's binding site to be used as templates for SYBYL/UNITY database searching. 

6. SYBYL/UNITY is used to create excluded volume and distance constrained 
queries for searching structural databases. Structures qualifying as 'hits' are screened for 
activity. 

25 7. Once specific inhibitor-protein interactions are determined between new 

inhibitors and the protein structure, docking studies may be carried out between the 
different series of in-house inhibitors and ACR This part gives the initial modeled 
complexes of new inhibitors with ACR 

To check for the integrity of the modeled new ACP-inhibitor complexes, different 

30 procedures may be used. In this case, constrained conformational analysis is carried out 
using molecular dynamics methods. In this modeling process, both protein and the 



complexed ligand are allowed to sample different 3D conformational states until the most 
favorable state is reached or found to exist between protein and inhibitor. The final 
structure as proposed by the molecular dynamics analysis is analyzed visually to make 
sure the modeled complex is in accord with known experimental SAR based on measured 
binding affinities. 
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